Dataset statistics
| Number of variables | 19 |
|---|---|
| Number of observations | 104768 |
| Missing cells | 292984 |
| Missing cells (%) | 14.7% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 16.0 MiB |
| Average record size in memory | 160.0 B |
Variable types
| Numeric | 10 |
|---|---|
| Categorical | 7 |
| DateTime | 1 |
| Unsupported | 1 |
Store is highly overall correlated with PromoInterval | High correlation |
CompetitionOpenSinceMonth is highly overall correlated with PromoInterval | High correlation |
CompetitionOpenSinceYear is highly overall correlated with PromoInterval | High correlation |
Promo2SinceWeek is highly overall correlated with StoreType and 2 other fields | High correlation |
Promo2SinceYear is highly overall correlated with StoreType and 1 other fields | High correlation |
DayOfWeek is highly overall correlated with Open | High correlation |
Sales is highly overall correlated with Customers and 3 other fields | High correlation |
Customers is highly overall correlated with Sales and 2 other fields | High correlation |
Normalized_Sales is highly overall correlated with Sales and 3 other fields | High correlation |
StoreType is highly overall correlated with Promo2SinceWeek and 1 other fields | High correlation |
Promo2 is highly overall correlated with Promo2SinceWeek and 2 other fields | High correlation |
PromoInterval is highly overall correlated with Store and 4 other fields | High correlation |
Open is highly overall correlated with DayOfWeek and 3 other fields | High correlation |
Promo is highly overall correlated with Sales and 1 other fields | High correlation |
CompetitionOpenSinceMonth has 30902 (29.5%) missing values | Missing |
CompetitionOpenSinceYear has 30902 (29.5%) missing values | Missing |
Promo2SinceWeek has 77060 (73.6%) missing values | Missing |
Promo2SinceYear has 77060 (73.6%) missing values | Missing |
PromoInterval has 77060 (73.6%) missing values | Missing |
StateHoliday is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Sales has 16585 (15.8%) zeros | Zeros |
Customers has 16585 (15.8%) zeros | Zeros |
Normalized_Sales has 16585 (15.8%) zeros | Zeros |
Reproduction
| Analysis started | 2023-11-14 12:09:51.034638 |
|---|---|
| Analysis finished | 2023-11-14 12:10:08.132053 |
| Duration | 17.1 seconds |
| Software version | ydata-profiling vv4.6.1 |
| Download configuration | config.json |
Store
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 112 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 568.68557 |
| Minimum | 4 |
|---|---|
| Maximum | 1114 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.6 MiB |
Quantile statistics
| Minimum | 4 |
|---|---|
| 5-th percentile | 64 |
| Q1 | 368 |
| median | 544 |
| Q3 | 787 |
| 95-th percentile | 1089 |
| Maximum | 1114 |
| Range | 1110 |
| Interquartile range (IQR) | 419 |
Descriptive statistics
| Standard deviation | 290.14323 |
|---|---|
| Coefficient of variation (CV) | 0.51019973 |
| Kurtosis | -0.77179221 |
| Mean | 568.68557 |
| Median Absolute Deviation (MAD) | 211 |
| Skewness | 0.07022797 |
| Sum | 59580050 |
| Variance | 84183.091 |
| Monotonicity | Increasing |
| Value | Count | Frequency (%) |
| 4 | 942 | 0.9% |
| 679 | 942 | 0.9% |
| 755 | 942 | 0.9% |
| 733 | 942 | 0.9% |
| 729 | 942 | 0.9% |
| 726 | 942 | 0.9% |
| 722 | 942 | 0.9% |
| 713 | 942 | 0.9% |
| 709 | 942 | 0.9% |
| 704 | 942 | 0.9% |
| Other values (102) | 95348 |
| Value | Count | Frequency (%) |
| 4 | 942 | |
| 25 | 942 | |
| 35 | 942 | |
| 42 | 942 | |
| 57 | 942 | |
| 64 | 942 | |
| 84 | 942 | |
| 104 | 942 | |
| 125 | 942 | |
| 157 | 942 |
| Value | Count | Frequency (%) |
| 1114 | 942 | |
| 1112 | 942 | |
| 1101 | 942 | |
| 1097 | 942 | |
| 1092 | 758 | |
| 1089 | 942 | |
| 1075 | 942 | |
| 1066 | 942 | |
| 1033 | 942 | |
| 1027 | 758 |
StoreType
Categorical
HIGH CORRELATION 
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.6 MiB |
| a | |
|---|---|
| d | |
| c | |
| b |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 104768 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | c |
|---|---|
| 2nd row | c |
| 3rd row | c |
| 4th row | c |
| 5th row | c |
Common Values
| Value | Count | Frequency (%) |
| a | 61620 | |
| d | 20540 | 19.6% |
| c | 14130 | 13.5% |
| b | 8478 | 8.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| a | 61620 | |
| d | 20540 | 19.6% |
| c | 14130 | 13.5% |
| b | 8478 | 8.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 61620 | |
| d | 20540 | 19.6% |
| c | 14130 | 13.5% |
| b | 8478 | 8.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 104768 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 61620 | |
| d | 20540 | 19.6% |
| c | 14130 | 13.5% |
| b | 8478 | 8.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 104768 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 61620 | |
| d | 20540 | 19.6% |
| c | 14130 | 13.5% |
| b | 8478 | 8.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 104768 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 61620 | |
| d | 20540 | 19.6% |
| c | 14130 | 13.5% |
| b | 8478 | 8.1% |
Assortment
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.6 MiB |
| c | |
|---|---|
| a | |
| b | 3768 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 104768 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | c |
|---|---|
| 2nd row | c |
| 3rd row | c |
| 4th row | c |
| 5th row | c |
Common Values
| Value | Count | Frequency (%) |
| c | 61620 | |
| a | 39380 | |
| b | 3768 | 3.6% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| c | 61620 | |
| a | 39380 | |
| b | 3768 | 3.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| c | 61620 | |
| a | 39380 | |
| b | 3768 | 3.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 104768 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| c | 61620 | |
| a | 39380 | |
| b | 3768 | 3.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 104768 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| c | 61620 | |
| a | 39380 | |
| b | 3768 | 3.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 104768 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| c | 61620 | |
| a | 39380 | |
| b | 3768 | 3.6% |
CompetitionDistance
Real number (ℝ)
| Distinct | 95 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4021.9645 |
| Minimum | 40 |
|---|---|
| Maximum | 40540 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.6 MiB |
Quantile statistics
| Minimum | 40 |
|---|---|
| 5-th percentile | 60 |
| Q1 | 285 |
| median | 1210 |
| Q3 | 4060 |
| 95-th percentile | 22330 |
| Maximum | 40540 |
| Range | 40500 |
| Interquartile range (IQR) | 3775 |
Descriptive statistics
| Standard deviation | 7051.1133 |
|---|---|
| Coefficient of variation (CV) | 1.7531515 |
| Kurtosis | 8.6841229 |
| Mean | 4021.9645 |
| Median Absolute Deviation (MAD) | 1040 |
| Skewness | 2.8546501 |
| Sum | 4.2137318 × 108 |
| Variance | 49718199 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 50 | 3768 | 3.6% |
| 220 | 2826 | 2.7% |
| 250 | 2826 | 2.7% |
| 90 | 2826 | 2.7% |
| 210 | 2826 | 2.7% |
| 350 | 1884 | 1.8% |
| 580 | 1884 | 1.8% |
| 1910 | 1884 | 1.8% |
| 140 | 1884 | 1.8% |
| 120 | 1884 | 1.8% |
| Other values (85) | 80276 |
| Value | Count | Frequency (%) |
| 40 | 942 | 0.9% |
| 50 | 3768 | |
| 60 | 942 | 0.9% |
| 80 | 942 | 0.9% |
| 90 | 2826 | |
| 120 | 1884 | |
| 140 | 1884 | |
| 150 | 942 | 0.9% |
| 170 | 942 | 0.9% |
| 190 | 1700 |
| Value | Count | Frequency (%) |
| 40540 | 942 | |
| 33060 | 942 | |
| 23620 | 942 | |
| 23130 | 942 | |
| 22560 | 942 | |
| 22330 | 942 | |
| 20620 | 942 | |
| 20390 | 942 | |
| 16490 | 942 | |
| 15340 | 942 |
CompetitionOpenSinceMonth
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 30902 |
| Missing (%) | 29.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.775729 |
| Minimum | 1 |
|---|---|
| Maximum | 12 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.6 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 4 |
| median | 6 |
| Q3 | 9 |
| 95-th percentile | 12 |
| Maximum | 12 |
| Range | 11 |
| Interquartile range (IQR) | 5 |
Descriptive statistics
| Standard deviation | 3.1127001 |
|---|---|
| Coefficient of variation (CV) | 0.45938969 |
| Kurtosis | -1.1413135 |
| Mean | 6.775729 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | 0.059514846 |
| Sum | 500496 |
| Variance | 9.6889017 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 4 | 10362 | 9.9% |
| 9 | 9420 | 9.0% |
| 6 | 8294 | 7.9% |
| 8 | 6594 | 6.3% |
| 5 | 6594 | 6.3% |
| 3 | 6594 | 6.3% |
| 10 | 5652 | 5.4% |
| 12 | 5652 | 5.4% |
| 11 | 5468 | 5.2% |
| 2 | 3768 | 3.6% |
| Other values (2) | 5468 | 5.2% |
| (Missing) | 30902 |
| Value | Count | Frequency (%) |
| 1 | 1884 | 1.8% |
| 2 | 3768 | 3.6% |
| 3 | 6594 | |
| 4 | 10362 | |
| 5 | 6594 | |
| 6 | 8294 | |
| 7 | 3584 | 3.4% |
| 8 | 6594 | |
| 9 | 9420 | |
| 10 | 5652 |
| Value | Count | Frequency (%) |
| 12 | 5652 | |
| 11 | 5468 | |
| 10 | 5652 | |
| 9 | 9420 | |
| 8 | 6594 | |
| 7 | 3584 | 3.4% |
| 6 | 8294 | |
| 5 | 6594 | |
| 4 | 10362 | |
| 3 | 6594 |
CompetitionOpenSinceYear
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 17 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 30902 |
| Missing (%) | 29.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2008.6601 |
| Minimum | 1999 |
|---|---|
| Maximum | 2015 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.6 MiB |
Quantile statistics
| Minimum | 1999 |
|---|---|
| 5-th percentile | 2001 |
| Q1 | 2005 |
| median | 2009 |
| Q3 | 2013 |
| 95-th percentile | 2014 |
| Maximum | 2015 |
| Range | 16 |
| Interquartile range (IQR) | 8 |
Descriptive statistics
| Standard deviation | 4.1633051 |
|---|---|
| Coefficient of variation (CV) | 0.0020726778 |
| Kurtosis | -0.82778341 |
| Mean | 2008.6601 |
| Median Absolute Deviation (MAD) | 4 |
| Skewness | -0.39051738 |
| Sum | 1.4837168 × 108 |
| Variance | 17.333109 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2013 | 9420 | 9.0% |
| 2009 | 8478 | 8.1% |
| 2014 | 7536 | 7.2% |
| 2005 | 7536 | 7.2% |
| 2006 | 6594 | 6.3% |
| 2012 | 5652 | 5.4% |
| 2010 | 4710 | 4.5% |
| 2008 | 4526 | 4.3% |
| 2011 | 3768 | 3.6% |
| 2003 | 2826 | 2.7% |
| Other values (7) | 12820 | |
| (Missing) | 30902 |
| Value | Count | Frequency (%) |
| 1999 | 942 | 0.9% |
| 2000 | 1700 | 1.6% |
| 2001 | 1884 | 1.8% |
| 2002 | 2826 | 2.7% |
| 2003 | 2826 | 2.7% |
| 2004 | 942 | 0.9% |
| 2005 | 7536 | |
| 2006 | 6594 | |
| 2007 | 2642 | 2.5% |
| 2008 | 4526 |
| Value | Count | Frequency (%) |
| 2015 | 1884 | 1.8% |
| 2014 | 7536 | |
| 2013 | 9420 | |
| 2012 | 5652 | |
| 2011 | 3768 | 3.6% |
| 2010 | 4710 | |
| 2009 | 8478 | |
| 2008 | 4526 | |
| 2007 | 2642 | 2.5% |
| 2006 | 6594 |
Promo2
Categorical
HIGH CORRELATION 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.6 MiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 104768 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 77060 | |
| 1 | 27708 | 26.4% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 77060 | |
| 1 | 27708 | 26.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 77060 | |
| 1 | 27708 | 26.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 104768 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 77060 | |
| 1 | 27708 | 26.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 104768 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 77060 | |
| 1 | 27708 | 26.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 104768 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 77060 | |
| 1 | 27708 | 26.4% |
Promo2SinceWeek
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 15 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 77060 |
| Missing (%) | 73.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 26.435037 |
| Minimum | 1 |
|---|---|
| Maximum | 48 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.6 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 10 |
| median | 31 |
| Q3 | 40 |
| 95-th percentile | 48 |
| Maximum | 48 |
| Range | 47 |
| Interquartile range (IQR) | 30 |
Descriptive statistics
| Standard deviation | 15.696298 |
|---|---|
| Coefficient of variation (CV) | 0.59376872 |
| Kurtosis | -1.5283546 |
| Mean | 26.435037 |
| Median Absolute Deviation (MAD) | 14 |
| Skewness | -0.24610407 |
| Sum | 732462 |
| Variance | 246.37377 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 40 | 5100 | 4.9% |
| 10 | 3768 | 3.6% |
| 37 | 2826 | 2.7% |
| 1 | 1884 | 1.8% |
| 5 | 1884 | 1.8% |
| 31 | 1884 | 1.8% |
| 45 | 1884 | 1.8% |
| 48 | 1884 | 1.8% |
| 14 | 942 | 0.9% |
| 39 | 942 | 0.9% |
| Other values (5) | 4710 | 4.5% |
| (Missing) | 77060 |
| Value | Count | Frequency (%) |
| 1 | 1884 | |
| 5 | 1884 | |
| 9 | 942 | 0.9% |
| 10 | 3768 | |
| 13 | 942 | 0.9% |
| 14 | 942 | 0.9% |
| 18 | 942 | 0.9% |
| 22 | 942 | 0.9% |
| 31 | 1884 | |
| 35 | 942 | 0.9% |
| Value | Count | Frequency (%) |
| 48 | 1884 | 1.8% |
| 45 | 1884 | 1.8% |
| 40 | 5100 | |
| 39 | 942 | 0.9% |
| 37 | 2826 | |
| 35 | 942 | 0.9% |
| 31 | 1884 | 1.8% |
| 22 | 942 | 0.9% |
| 18 | 942 | 0.9% |
| 14 | 942 | 0.9% |
Promo2SinceYear
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 77060 |
| Missing (%) | 73.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2011.7081 |
| Minimum | 2009 |
|---|---|
| Maximum | 2014 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.6 MiB |
Quantile statistics
| Minimum | 2009 |
|---|---|
| 5-th percentile | 2009 |
| Q1 | 2010 |
| median | 2012 |
| Q3 | 2014 |
| 95-th percentile | 2014 |
| Maximum | 2014 |
| Range | 5 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 1.8414341 |
|---|---|
| Coefficient of variation (CV) | 0.00091535851 |
| Kurtosis | -1.3372398 |
| Mean | 2011.7081 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | -0.16803939 |
| Sum | 55740408 |
| Variance | 3.3908797 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2014 | 7168 | 6.8% |
| 2009 | 5652 | 5.4% |
| 2011 | 5468 | 5.2% |
| 2012 | 3768 | 3.6% |
| 2013 | 3768 | 3.6% |
| 2010 | 1884 | 1.8% |
| (Missing) | 77060 |
| Value | Count | Frequency (%) |
| 2009 | 5652 | |
| 2010 | 1884 | 1.8% |
| 2011 | 5468 | |
| 2012 | 3768 | |
| 2013 | 3768 | |
| 2014 | 7168 |
| Value | Count | Frequency (%) |
| 2014 | 7168 | |
| 2013 | 3768 | |
| 2012 | 3768 | |
| 2011 | 5468 | |
| 2010 | 1884 | 1.8% |
| 2009 | 5652 |
PromoInterval
Categorical
HIGH CORRELATION  MISSING 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 77060 |
| Missing (%) | 73.6% |
| Memory size | 1.6 MiB |
| Jan,Apr,Jul,Oct | |
|---|---|
| Feb,May,Aug,Nov | |
| Mar,Jun,Sept,Dec |
Length
| Max length | 16 |
|---|---|
| Median length | 15 |
| Mean length | 15.169987 |
| Min length | 15 |
Characters and Unicode
| Total characters | 420330 |
|---|---|
| Distinct characters | 23 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Jan,Apr,Jul,Oct |
|---|---|
| 2nd row | Jan,Apr,Jul,Oct |
| 3rd row | Jan,Apr,Jul,Oct |
| 4th row | Jan,Apr,Jul,Oct |
| 5th row | Jan,Apr,Jul,Oct |
Common Values
| Value | Count | Frequency (%) |
| Jan,Apr,Jul,Oct | 17346 | 16.6% |
| Feb,May,Aug,Nov | 5652 | 5.4% |
| Mar,Jun,Sept,Dec | 4710 | 4.5% |
| (Missing) | 77060 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| jan,apr,jul,oct | 17346 | |
| feb,may,aug,nov | 5652 | 20.4% |
| mar,jun,sept,dec | 4710 | 17.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| , | 83124 | |
| J | 39402 | 9.4% |
| u | 27708 | 6.6% |
| a | 27708 | 6.6% |
| A | 22998 | 5.5% |
| c | 22056 | 5.2% |
| t | 22056 | 5.2% |
| r | 22056 | 5.2% |
| p | 22056 | 5.2% |
| n | 22056 | 5.2% |
| Other values (13) | 109110 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 226374 | |
| Uppercase Letter | 110832 | |
| Other Punctuation | 83124 | 19.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| u | 27708 | |
| a | 27708 | |
| c | 22056 | |
| t | 22056 | |
| r | 22056 | |
| p | 22056 | |
| n | 22056 | |
| l | 17346 | |
| e | 15072 | |
| b | 5652 | 2.5% |
| Other values (4) | 22608 |
Uppercase Letter
| Value | Count | Frequency (%) |
| J | 39402 | |
| A | 22998 | |
| O | 17346 | |
| M | 10362 | 9.3% |
| F | 5652 | 5.1% |
| N | 5652 | 5.1% |
| S | 4710 | 4.2% |
| D | 4710 | 4.2% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 83124 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 337206 | |
| Common | 83124 | 19.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| J | 39402 | |
| u | 27708 | 8.2% |
| a | 27708 | 8.2% |
| A | 22998 | 6.8% |
| c | 22056 | 6.5% |
| t | 22056 | 6.5% |
| r | 22056 | 6.5% |
| p | 22056 | 6.5% |
| n | 22056 | 6.5% |
| l | 17346 | 5.1% |
| Other values (12) | 91764 |
Common
| Value | Count | Frequency (%) |
| , | 83124 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 420330 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| , | 83124 | |
| J | 39402 | 9.4% |
| u | 27708 | 6.6% |
| a | 27708 | 6.6% |
| A | 22998 | 5.5% |
| c | 22056 | 5.2% |
| t | 22056 | 5.2% |
| r | 22056 | 5.2% |
| p | 22056 | 5.2% |
| n | 22056 | 5.2% |
| Other values (13) | 109110 |
DayOfWeek
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.9979765 |
| Minimum | 1 |
|---|---|
| Maximum | 7 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.6 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 4 |
| Q3 | 6 |
| 95-th percentile | 7 |
| Maximum | 7 |
| Range | 6 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 1.9973628 |
|---|---|
| Coefficient of variation (CV) | 0.49959344 |
| Kurtosis | -1.2469264 |
| Mean | 3.9979765 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 0.0020091688 |
| Sum | 418860 |
| Variance | 3.9894582 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 5 | 15016 | |
| 4 | 15016 | |
| 3 | 15012 | |
| 2 | 15012 | |
| 1 | 14904 | |
| 7 | 14904 | |
| 6 | 14904 |
| Value | Count | Frequency (%) |
| 1 | 14904 | |
| 2 | 15012 | |
| 3 | 15012 | |
| 4 | 15016 | |
| 5 | 15016 | |
| 6 | 14904 | |
| 7 | 14904 |
| Value | Count | Frequency (%) |
| 7 | 14904 | |
| 6 | 14904 | |
| 5 | 15016 | |
| 4 | 15016 | |
| 3 | 15012 | |
| 2 | 15012 | |
| 1 | 14904 |
Date
Date
| Distinct | 942 |
|---|---|
| Distinct (%) | 0.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.6 MiB |
| Minimum | 2013-01-01 00:00:00 |
|---|---|
| Maximum | 2015-07-31 00:00:00 |
Sales
Real number (ℝ)
HIGH CORRELATION  ZEROS 
| Distinct | 17114 |
|---|---|
| Distinct (%) | 16.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 10044.839 |
| Minimum | 0 |
|---|---|
| Maximum | 38722 |
| Zeros | 16585 |
| Zeros (%) | 15.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.6 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 7878 |
| median | 10324 |
| Q3 | 13238 |
| 95-th percentile | 19112 |
| Maximum | 38722 |
| Range | 38722 |
| Interquartile range (IQR) | 5360 |
Descriptive statistics
| Standard deviation | 5686.5633 |
|---|---|
| Coefficient of variation (CV) | 0.56611791 |
| Kurtosis | 0.33754372 |
| Mean | 10044.839 |
| Median Absolute Deviation (MAD) | 2642 |
| Skewness | -0.080041153 |
| Sum | 1.0523777 × 109 |
| Variance | 32337002 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 16585 | 15.8% |
| 10428 | 26 | < 0.1% |
| 8946 | 25 | < 0.1% |
| 8311 | 24 | < 0.1% |
| 10425 | 24 | < 0.1% |
| 10385 | 24 | < 0.1% |
| 9501 | 24 | < 0.1% |
| 9169 | 24 | < 0.1% |
| 9662 | 24 | < 0.1% |
| 9189 | 23 | < 0.1% |
| Other values (17104) | 87965 |
| Value | Count | Frequency (%) |
| 0 | 16585 | |
| 1407 | 1 | < 0.1% |
| 1410 | 1 | < 0.1% |
| 2070 | 1 | < 0.1% |
| 2099 | 1 | < 0.1% |
| 2177 | 1 | < 0.1% |
| 2223 | 1 | < 0.1% |
| 2326 | 1 | < 0.1% |
| 2347 | 1 | < 0.1% |
| 2363 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 38722 | 1 | |
| 38484 | 1 | |
| 38367 | 1 | |
| 38037 | 1 | |
| 38025 | 1 | |
| 37646 | 1 | |
| 37403 | 1 | |
| 37376 | 1 | |
| 37122 | 1 | |
| 36417 | 1 |
Customers
Real number (ℝ)
HIGH CORRELATION  ZEROS 
| Distinct | 3796 |
|---|---|
| Distinct (%) | 3.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1229.8038 |
| Minimum | 0 |
|---|---|
| Maximum | 7388 |
| Zeros | 16585 |
| Zeros (%) | 15.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.6 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 838 |
| median | 1177 |
| Q3 | 1601 |
| 95-th percentile | 2828 |
| Maximum | 7388 |
| Range | 7388 |
| Interquartile range (IQR) | 763 |
Descriptive statistics
| Standard deviation | 811.24939 |
|---|---|
| Coefficient of variation (CV) | 0.65965756 |
| Kurtosis | 1.0621201 |
| Mean | 1229.8038 |
| Median Absolute Deviation (MAD) | 376 |
| Skewness | 0.66055617 |
| Sum | 1.2884409 × 108 |
| Variance | 658125.57 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 16585 | 15.8% |
| 1075 | 175 | 0.2% |
| 989 | 166 | 0.2% |
| 1070 | 159 | 0.2% |
| 1123 | 158 | 0.2% |
| 1113 | 157 | 0.1% |
| 1150 | 156 | 0.1% |
| 1032 | 154 | 0.1% |
| 1102 | 153 | 0.1% |
| 1145 | 152 | 0.1% |
| Other values (3786) | 86753 |
| Value | Count | Frequency (%) |
| 0 | 16585 | |
| 156 | 1 | < 0.1% |
| 197 | 1 | < 0.1% |
| 237 | 1 | < 0.1% |
| 267 | 1 | < 0.1% |
| 274 | 1 | < 0.1% |
| 276 | 1 | < 0.1% |
| 280 | 1 | < 0.1% |
| 282 | 1 | < 0.1% |
| 301 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 7388 | 1 | |
| 5494 | 1 | |
| 5458 | 1 | |
| 5387 | 1 | |
| 5297 | 1 | |
| 5192 | 1 | |
| 5152 | 1 | |
| 5145 | 1 | |
| 5132 | 1 | |
| 5112 | 1 |
Open
Categorical
HIGH CORRELATION 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.6 MiB |
| 1 | |
|---|---|
| 0 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 104768 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 88189 | |
| 0 | 16579 | 15.8% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 88189 | |
| 0 | 16579 | 15.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 88189 | |
| 0 | 16579 | 15.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 104768 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 88189 | |
| 0 | 16579 | 15.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 104768 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 88189 | |
| 0 | 16579 | 15.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 104768 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 88189 | |
| 0 | 16579 | 15.8% |
Promo
Categorical
HIGH CORRELATION 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.6 MiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 104768 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 64744 | |
| 1 | 40024 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 64744 | |
| 1 | 40024 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 64744 | |
| 1 | 40024 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 104768 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 64744 | |
| 1 | 40024 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 104768 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 64744 | |
| 1 | 40024 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 104768 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 64744 | |
| 1 | 40024 |
StateHoliday
Unsupported
REJECTED  UNSUPPORTED 
| Missing | 0 |
|---|---|
| Missing (%) | 0.0% |
| Memory size | 1.6 MiB |
SchoolHoliday
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.6 MiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 104768 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 85730 | |
| 1 | 19038 | 18.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 85730 | |
| 1 | 19038 | 18.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 85730 | |
| 1 | 19038 | 18.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 104768 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 85730 | |
| 1 | 19038 | 18.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 104768 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 85730 | |
| 1 | 19038 | 18.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 104768 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 85730 | |
| 1 | 19038 | 18.2% |
Normalized_Sales
Real number (ℝ)
HIGH CORRELATION  ZEROS 
| Distinct | 17114 |
|---|---|
| Distinct (%) | 16.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.2594091 |
| Minimum | 0 |
|---|---|
| Maximum | 1 |
| Zeros | 16585 |
| Zeros (%) | 15.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.6 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0.20345024 |
| median | 0.26661846 |
| Q3 | 0.34187284 |
| 95-th percentile | 0.49356955 |
| Maximum | 1 |
| Range | 1 |
| Interquartile range (IQR) | 0.1384226 |
Descriptive statistics
| Standard deviation | 0.14685614 |
|---|---|
| Coefficient of variation (CV) | 0.56611791 |
| Kurtosis | 0.33754372 |
| Mean | 0.2594091 |
| Median Absolute Deviation (MAD) | 0.068229947 |
| Skewness | -0.080041153 |
| Sum | 27177.772 |
| Variance | 0.021566724 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 16585 | 15.8% |
| 0.2693042715 | 26 | < 0.1% |
| 0.231031455 | 25 | < 0.1% |
| 0.2146325087 | 24 | < 0.1% |
| 0.2692267961 | 24 | < 0.1% |
| 0.2681937916 | 24 | < 0.1% |
| 0.2453643923 | 24 | < 0.1% |
| 0.236790455 | 24 | < 0.1% |
| 0.2495222354 | 24 | < 0.1% |
| 0.2373069573 | 23 | < 0.1% |
| Other values (17104) | 87965 |
| Value | Count | Frequency (%) |
| 0 | 16585 | |
| 0.03633593306 | 1 | < 0.1% |
| 0.0364134084 | 1 | < 0.1% |
| 0.05345798254 | 1 | < 0.1% |
| 0.0542069108 | 1 | < 0.1% |
| 0.05622126956 | 1 | < 0.1% |
| 0.05740922473 | 1 | < 0.1% |
| 0.0600692113 | 1 | < 0.1% |
| 0.06061153866 | 1 | < 0.1% |
| 0.06102474046 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 1 | 1 | |
| 0.9938536233 | 1 | |
| 0.9908320851 | 1 | |
| 0.982309798 | 1 | |
| 0.9819998967 | 1 | |
| 0.9722121791 | 1 | |
| 0.9659366768 | 1 | |
| 0.9652393988 | 1 | |
| 0.9586798203 | 1 | |
| 0.9404731161 | 1 |
| Store | CompetitionDistance | CompetitionOpenSinceMonth | CompetitionOpenSinceYear | Promo2SinceWeek | Promo2SinceYear | DayOfWeek | Sales | Customers | Normalized_Sales | StoreType | Assortment | Promo2 | PromoInterval | Open | Promo | SchoolHoliday | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Store | 1.000 | -0.053 | -0.151 | -0.062 | 0.130 | 0.300 | 0.000 | 0.020 | 0.007 | 0.020 | 0.270 | 0.231 | 0.272 | 0.567 | 0.041 | 0.000 | 0.000 |
| CompetitionDistance | -0.053 | 1.000 | -0.114 | 0.038 | -0.133 | -0.073 | -0.000 | -0.126 | -0.365 | -0.126 | 0.273 | 0.209 | 0.219 | 0.226 | 0.019 | 0.000 | 0.000 |
| CompetitionOpenSinceMonth | -0.151 | -0.114 | 1.000 | 0.014 | -0.290 | 0.024 | 0.000 | 0.015 | -0.037 | 0.015 | 0.379 | 0.395 | 0.156 | 0.624 | 0.044 | 0.000 | 0.000 |
| CompetitionOpenSinceYear | -0.062 | 0.038 | 0.014 | 1.000 | 0.092 | -0.047 | -0.000 | 0.065 | 0.048 | 0.065 | 0.280 | 0.416 | 0.426 | 0.834 | 0.035 | 0.000 | 0.000 |
| Promo2SinceWeek | 0.130 | -0.133 | -0.290 | 0.092 | 1.000 | -0.434 | 0.000 | 0.078 | 0.158 | 0.078 | 0.590 | 0.407 | 1.000 | 0.629 | 0.066 | 0.000 | 0.000 |
| Promo2SinceYear | 0.300 | -0.073 | 0.024 | -0.047 | -0.434 | 1.000 | 0.000 | 0.000 | -0.005 | 0.000 | 0.512 | 0.389 | 1.000 | 0.362 | 0.052 | 0.000 | 0.000 |
| DayOfWeek | 0.000 | -0.000 | 0.000 | -0.000 | 0.000 | 0.000 | 1.000 | -0.440 | -0.379 | -0.440 | 0.000 | 0.000 | 0.000 | 0.000 | 0.851 | 0.496 | 0.273 |
| Sales | 0.020 | -0.126 | 0.015 | 0.065 | 0.078 | 0.000 | -0.440 | 1.000 | 0.835 | 1.000 | 0.136 | 0.081 | 0.100 | 0.099 | 0.993 | 0.501 | 0.098 |
| Customers | 0.007 | -0.365 | -0.037 | 0.048 | 0.158 | -0.005 | -0.379 | 0.835 | 1.000 | 0.835 | 0.392 | 0.361 | 0.099 | 0.184 | 0.854 | 0.347 | 0.088 |
| Normalized_Sales | 0.020 | -0.126 | 0.015 | 0.065 | 0.078 | 0.000 | -0.440 | 1.000 | 0.835 | 1.000 | 0.136 | 0.081 | 0.100 | 0.099 | 0.993 | 0.501 | 0.098 |
| StoreType | 0.270 | 0.273 | 0.379 | 0.280 | 0.590 | 0.512 | 0.000 | 0.136 | 0.392 | 0.136 | 1.000 | 0.482 | 0.129 | 0.461 | 0.128 | 0.000 | 0.000 |
| Assortment | 0.231 | 0.209 | 0.395 | 0.416 | 0.407 | 0.389 | 0.000 | 0.081 | 0.361 | 0.081 | 0.482 | 1.000 | 0.013 | 0.140 | 0.084 | 0.000 | 0.001 |
| Promo2 | 0.272 | 0.219 | 0.156 | 0.426 | 1.000 | 1.000 | 0.000 | 0.100 | 0.099 | 0.100 | 0.129 | 0.013 | 1.000 | 1.000 | 0.005 | 0.000 | 0.003 |
| PromoInterval | 0.567 | 0.226 | 0.624 | 0.834 | 0.629 | 0.362 | 0.000 | 0.099 | 0.184 | 0.099 | 0.461 | 0.140 | 1.000 | 1.000 | 0.022 | 0.000 | 0.007 |
| Open | 0.041 | 0.019 | 0.044 | 0.035 | 0.066 | 0.052 | 0.851 | 0.993 | 0.854 | 0.993 | 0.128 | 0.084 | 0.005 | 0.022 | 1.000 | 0.290 | 0.094 |
| Promo | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.496 | 0.501 | 0.347 | 0.501 | 0.000 | 0.000 | 0.000 | 0.000 | 0.290 | 1.000 | 0.071 |
| SchoolHoliday | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.273 | 0.098 | 0.088 | 0.098 | 0.000 | 0.001 | 0.003 | 0.007 | 0.094 | 0.071 | 1.000 |
| Store | StoreType | Assortment | CompetitionDistance | CompetitionOpenSinceMonth | CompetitionOpenSinceYear | Promo2 | Promo2SinceWeek | Promo2SinceYear | PromoInterval | DayOfWeek | Date | Sales | Customers | Open | Promo | StateHoliday | SchoolHoliday | Normalized_Sales | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 2826 | 4 | c | c | 620.0 | 9.0 | 2009.0 | 0 | NaN | NaN | NaN | 5 | 2015-07-31 | 13995 | 1498 | 1 | 1 | 0 | 1 | 0.361422 |
| 2827 | 4 | c | c | 620.0 | 9.0 | 2009.0 | 0 | NaN | NaN | NaN | 4 | 2015-07-30 | 10387 | 1276 | 1 | 1 | 0 | 1 | 0.268245 |
| 2828 | 4 | c | c | 620.0 | 9.0 | 2009.0 | 0 | NaN | NaN | NaN | 3 | 2015-07-29 | 10514 | 1258 | 1 | 1 | 0 | 1 | 0.271525 |
| 2829 | 4 | c | c | 620.0 | 9.0 | 2009.0 | 0 | NaN | NaN | NaN | 2 | 2015-07-28 | 10275 | 1191 | 1 | 1 | 0 | 1 | 0.265353 |
| 2830 | 4 | c | c | 620.0 | 9.0 | 2009.0 | 0 | NaN | NaN | NaN | 1 | 2015-07-27 | 11812 | 1379 | 1 | 1 | 0 | 1 | 0.305046 |
| 2831 | 4 | c | c | 620.0 | 9.0 | 2009.0 | 0 | NaN | NaN | NaN | 7 | 2015-07-26 | 0 | 0 | 0 | 0 | 0 | 0 | 0.000000 |
| 2832 | 4 | c | c | 620.0 | 9.0 | 2009.0 | 0 | NaN | NaN | NaN | 6 | 2015-07-25 | 9322 | 1219 | 1 | 0 | 0 | 0 | 0.240742 |
| 2833 | 4 | c | c | 620.0 | 9.0 | 2009.0 | 0 | NaN | NaN | NaN | 5 | 2015-07-24 | 8322 | 1108 | 1 | 0 | 0 | 1 | 0.214917 |
| 2834 | 4 | c | c | 620.0 | 9.0 | 2009.0 | 0 | NaN | NaN | NaN | 4 | 2015-07-23 | 7286 | 1101 | 1 | 0 | 0 | 1 | 0.188162 |
| 2835 | 4 | c | c | 620.0 | 9.0 | 2009.0 | 0 | NaN | NaN | NaN | 3 | 2015-07-22 | 8503 | 1108 | 1 | 0 | 0 | 1 | 0.219591 |
| Store | StoreType | Assortment | CompetitionDistance | CompetitionOpenSinceMonth | CompetitionOpenSinceYear | Promo2 | Promo2SinceWeek | Promo2SinceYear | PromoInterval | DayOfWeek | Date | Sales | Customers | Open | Promo | StateHoliday | SchoolHoliday | Normalized_Sales | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1016257 | 1114 | a | c | 870.0 | NaN | NaN | 0 | NaN | NaN | NaN | 4 | 2013-01-10 | 18075 | 2641 | 1 | 1 | 0 | 0 | 0.466789 |
| 1016258 | 1114 | a | c | 870.0 | NaN | NaN | 0 | NaN | NaN | NaN | 3 | 2013-01-09 | 17073 | 2481 | 1 | 1 | 0 | 0 | 0.440912 |
| 1016259 | 1114 | a | c | 870.0 | NaN | NaN | 0 | NaN | NaN | NaN | 2 | 2013-01-08 | 18816 | 2588 | 1 | 1 | 0 | 0 | 0.485925 |
| 1016260 | 1114 | a | c | 870.0 | NaN | NaN | 0 | NaN | NaN | NaN | 1 | 2013-01-07 | 21237 | 2962 | 1 | 1 | 0 | 0 | 0.548448 |
| 1016261 | 1114 | a | c | 870.0 | NaN | NaN | 0 | NaN | NaN | NaN | 7 | 2013-01-06 | 0 | 0 | 0 | 0 | 0 | 0 | 0.000000 |
| 1016262 | 1114 | a | c | 870.0 | NaN | NaN | 0 | NaN | NaN | NaN | 6 | 2013-01-05 | 18856 | 3065 | 1 | 0 | 0 | 0 | 0.486958 |
| 1016263 | 1114 | a | c | 870.0 | NaN | NaN | 0 | NaN | NaN | NaN | 5 | 2013-01-04 | 18371 | 3036 | 1 | 0 | 0 | 1 | 0.474433 |
| 1016264 | 1114 | a | c | 870.0 | NaN | NaN | 0 | NaN | NaN | NaN | 4 | 2013-01-03 | 18463 | 3211 | 1 | 0 | 0 | 1 | 0.476809 |
| 1016265 | 1114 | a | c | 870.0 | NaN | NaN | 0 | NaN | NaN | NaN | 3 | 2013-01-02 | 20642 | 3401 | 1 | 0 | 0 | 1 | 0.533082 |
| 1016266 | 1114 | a | c | 870.0 | NaN | NaN | 0 | NaN | NaN | NaN | 2 | 2013-01-01 | 0 | 0 | 0 | 0 | a | 1 | 0.000000 |